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We present an algorithm for deriving partonic flavour labels to be applied to truth particle 
jets in Monte Carlo event simulations. The inputs to this approach are final pre-hadronisation 
partons, to remove dependence on unphysical details such as the order of matrix element 
calculation and shower generator frame recoil treatment. These are clustered using standard jet 
algorithms, modified to restrict the allowed pseudojet combinations to those in which tracked 
flavour labels are consistent with QCD and QED Feynman rules. The resulting algorithm is 
shown to be portable between the major families of shower generators, and largely insensitive 
to many possible systematic variations: it hence offers significant advantages over existing ad 
hoc labelling schemes. TTowever, it is shown that contamination from multi-parton scattering 
simulations can disrupt the labelling results. Suggestions are made for further extension to 
incorporate more detailed QCD splitting function kinematics, robustness improvements, and 
potential uses for truth-level physics object definitions and tagging. 


Introduction 

The rise of jet substructure methods at the LHC has prompted a resurgence in attempts to 
distinguish "quark" and "gluon" hadronic jets from each other, primarily for use in searches for 
BSM phenomena. Such attempts are primarily based on the different colour charges of quarks 
and gluons, with the larger Ca colour factor of the gluon associated with more jet broadening 
and higher constituent multiplicities than the quarks' Cp factor. 

A conceptual problem immediately arises in that colour-neutral jets cannot perfectly corre¬ 
spond to coloured single partons. Additionally, final-state observables do not provide unam¬ 
biguous evidence tor two distinct statistical populations of hadronic jets. The evaluation oi q/g 
jet tagging methods has hence been based on use of event generators' partonic event graphs to 
define the "true" jet label, typically assigning a partonic identity to each final-state truth-jet by 
the flavour of the highest-energy or highest-p 7 parton found within a fixed angular distance of 
its centroid. Sometimes these labelling partons are chosen from the entire partonic event record. 
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including parton shower (PS) evolution; sometimes they are restricted to partons associated to 
the hard process matrix element (ME). The labels derived using these methods are then used to 
determine the efficiencies and fake rates of experimental tagging techniques. Such approaches to 
truth labelling have several practical and conceptual drawbacks: 

• their reliance on the unguaranteed physicality of partonic event records - these are typically 
intended more for debugging than for robust physics analysis use, and their momenta may 
not have physical meaning; 

• even where the parton momenta are physical in their chosen frame, it is often the case that 
different generations of parton evolution are represented in frames different from the lab 
frame relevant to final state observables; 

• the potential unphysical distinction between "matrix element" and "parton shower" par- 
tons - problematic for consistency of labelling at different perturbative orders and par¬ 
ticularly for "resummation-corrected" matrix elements such as those in the POWHEG 
method lUl where there is no clear kinematic distinction between ME and PS emissions. 

All of these limitations and assumptions cause problems in practice, notably in the inability to 
use the Sherpa event generator IJI whose event record is complexified by the use of matrix- 
element/part on-shower merging and matching (MEPS) and a dipole shower formalism with 
2—^3 parton branchings 0. More traditional parton showers, i.e. the 1 —> 2 parton splitting 
formulations used in the Pythia |31|5j|6l and Herwig generator families BZlIHl, are themselves 
problematic due to the need for momentum reshuffling to preserve Lorentz invariance in PS 
evolution: their off-shell intermediate parton representations may not have physically reasonable 
four-momenta, nor even be represented in a well-defined reference frame. Practically, even if 
physical information could in principle be obtained for every parton in a given MC genera¬ 
tor's event record, it would be extremely inconvenient to require separate algorithms for each 
generator's event records. 

In this paper we propose and characterise an alternative truth-jet labelling method, based on 
standard jet clustering algorithms and measures, modified to only permit clusterings compatible 
with 1 —> 2 QCD and QED processes. This has been implemented in the Pastjet framework for 
three standard jet measures, and we present studies of the performance, robustness and porta¬ 
bility of the resulting labels across physics processes, perturbative orders, PS event generators, 
distance measures, and soft physics effects. 

The primary motivation for this work is to provide an operational definition without the 
practical problems of ill-defined clustering inputs and generator-incompatibility which plague 
existing truth labelling schemes. However, we note the close relation of this scheme to prior 
work on infrared safety in partonic jet definition ||9| and discuss possibilities for extension of 
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this scheme both for improved perturbative safety and for use in fagging beyond fhe regime of 
Sudakov emission kinemafics. 


1 QCD-aware parton clustering 

The "QCD-aware" clustering algorithm which we will characterise in this paper is directly 
intended to address the limitations enumerated in the previous section. Our priority is to obtain 
a robust truth-jet labelling scheme usable with all event generator codes, and based on physically 
well-defined partons, and not at present the issues of QCD singularities focused on by the related 
Ravour-kj algorithm ||9|. 

1.1 Jet clustering algorithms 

QCD-aware clustering is, like flavour-fcx/ a modification of the well-established family of 
agglomerative jet clusterings, developed as an infrared & collinear safe alternative to cone-based 
jet finding |[T0]| . These operate by clustering an initial population of would-be jet constituents 
(particles, or experimental objects such as calorimeter cell clusters), known as pseudojets, into 
final jefs by repealed 2—^1 combinations of pseudojefs. Af each iterafion of the algorithm, the 
pair of pseudojets to be combined is that which minimises a distance measure containing energy 
and angular terms. The distance measure for fhe k^ family is 

d\f = min (kl\, kf^ ARfj/R^ (1) 

where i and j are fhe pseudojef indices, ARp is fhe beamline-boosf-invarianf disfance befween 
them in p-cp or y-(p space, R is a parameter defining the characteristic radius of the resulting 
jets, and the choice of n exponenf chooses whefher fhe algorifhm is "kj" (n = 1), "Cambridge- 
Aachen (C/A)"(n = 0), or "antb/cx" {n = — 1)[^ A pseudojet i is considered "final" and removed 
from the clustering if ifs nearesf disfance fo anofher pseudojef is greafer fhan k^-, fhe so-called 
beam disfance. Clusfering slops when no pseudojefs remain. 

The formal origins of fhis family of clustering distances lie in the form of fhe measure, which 
for the original kj measure is proportional to both the smallest transverse momentum and 
to AR. This focus on the low-kj and small emission angle regions ensures resummation of 
both the collinear and soft divergences in the QCD gluon emission splitting function, and kj 
clustering is often referred fo as an inversion of fhe QCD emission sequence of a parfon shower 
or resummafionl^ The lafer C/A and anfi-kx algorifhms only address fhe collinear divergence, 

^ Other, potentially non-integer, values could also be used but have received little attention since the sign of n is 
more important than its absolute value. 

^The aim of fhe flavour-fcx algorifhm was fo make fhis inverse relationship more precise, by using a disfincf 
disfance measure for fhe purely collinear divergence of fhe g ^ cjp parfon splitting. 
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Figure 1: Feynman rule vertices used for QCD (and QED) aware jet clustering. 

but due to its production of circular jets the latter has become the standard jet type used at 
the LHC. The Fastjet [[TT| package contains several important optimisations for the family, 
making use of geometric properties of the clustering measures to ameliorate the naive cubic 
dependence on the number of initial pseudojets. 


1.2 Flavour-aware clustering 

The relationship between jet clustering and dominant QCD emission kinematics is central to 
the QCD-aware approach to truth jet labelling. The "first" partons connected to a MC generator 
signal process suffer from an unphysical distinction between matrix element and shower emis¬ 
sions, as well as uncertainty over the physicality of their momenta, but inversion of the emission 
sequence starting from more physical final partons should in principle be well-behaved. 

We hence propose a minor modification to the above family of clustering algorithms, in which 
the measure is adapted to ensure that only clusterings compatible with the 3-point Feynman 
rules of QCD and QED will be granted a finite distance. The vertices corresponding to these 
Feynman rules are shown in Figure]^ Note that charged leptons and photons are also included. 
The "QCD-aware" distance measure is hence 



if flavours QCD/QED compatible, 
oo otherwise. 


( 2 ) 


As the effect is to veto flavour-incompatible clusterings which would take place in a standard 
flavour-blind jet algorithm, all the Eastjet performance optimisations for the ' family could 
also apply to the QCD-aware variants of those algorithms. 

Eor this to work, each pseudojet must carry an extra fxmdamental particle flavour label in 
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addition to the usual kinematic information, and this label must be set on the resulting pseudojet 
from each 2 —> 1 merging, as required by consistency with the corresponding Feynman rule. 
This measure modification and relabelling algorithm are implemented in the QCDAware Fastjet 
plugin class. 

It is important to note some simplifications in our scheme. First, we have ignored the 4-gluon 
vertex since it cannot be mapped to a 2 —> 1 clustering and has a small splitting fimction. 
Secondly, we do not permit ambiguity: a qq clustering could in principle resolve to either a gluon 
(as implemented) or a photonj^ One could consider both "histories" to be valid, assign weights 
according to computed relative probabilities, and eventually marginalise labelling results across 
the weighted ensemble of potential histories: this would be an approach along the lines of the 
shower deconstruction method |[T2|. Instead, for definiteness and computational tractability, we 
ignore the QED solution in favour of the much stronger strong coupling. Finally, we use a bare 
kj measure where the flavours are consistent, with no weighting for distinct splitting kinematics 
cf. ([avour-kj, nor for the relative strengths of (rurming) and aggo or the relevant charges of 
the participating particles. This last set of points is certainly worthy of consideration, but as we 
shall see even the bare algorithm exhibits interesting and useful features. 

Finally we highlight a somewhat obvious property of flavour-aware clustering which is useful 
to bear in mind: that the majority of emissions consist of emission of the gauge boson, i.e. the 
gluon or photon for strong and EM radiation respectively, and that this produces no flavour 
change. We would expect that it would be relatively hard for a quark jet to lose its quark label 
(this would require an accidental clustering with an antiquark of the same flavour), while a true 
gluon jet can be mislabelled as a quark jet by the accidental presence of a single quark within its 
capture radius. (The same applies to should-be isolated photons contaminated by the proximity 
of a quark or charged lepton.) We will consider this effect later, particularly when studying 
systematic variations on MC generator configurations. 

2 Selection of clustering inputs 

Before the "QCD-aware" clustering rules can be used to generate labelling parton-jets, we must 
identify the partons, photons, and leptons to be clustered. 

Only the final quarks and gluons — those immediately before hadronisation — are used 
as inputs to the clustering. This avoids double-coimting, since the partons in question are 
guaranteed not to have undergone any further splittings or radiation, and we have verified that 
the three main families of parton shower MC generators record the momenta of these partons 
in the lab frame|^ The representation of final partons is not uniform between generators, so 

^For now we also do not consider the —>• 7 inverted photon conversion as an allowed clustering. 

^This is not a standard imposed by the HepMC IflSl event record standard, but perhaps it should be. 
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we apply a two-step heuristic to identify them; first we accept a parton if it is incoming to a 
vertex with status/id = 5, the standard code for a hadronisation vertex; and secondly if the first 
condition is not met, the particle is accepted if it has no children which are also partons. The 
first condition is preferable, but hadronisation vertex labelling is currently only implemented in 
Sherpa, hence the second heuristic is required to identify final partons in Herwig++ and Pythia 8. 

Photons, electrons, muons, and hadronically-decaying taus that have no hadronic ancestors 
("prompt") are included in the clustering inputs. The "prompt" restriction is because the four- 
momentum associated with non-prompt photons and leptons is already included in the momenta 
of the final partons whose descendent hadrons decay to them. The restriction to hadronic taus is 
for alignment with experimental procedure: taus that decay to leptons are generally reconstructed 
by experiments as charged leptons and missing energy rather than as jets, with such charged 
leptons from prompt taus being themselves classified as prompt. 

An implementation of this input-selection algorithm is available in the FinaiPartons and 
TauFinder projections in Rivet lfT4l from version 2.2.1 onwards. 

3 Associating labels to jets 

After applying the QCD-aware clustering to the partonic (and prompt lepton and photon) inputs 
discussed above, and normal flavour-blind clustering to the final state truth particles, we have 
two distinct jet collections: flavoured partonic label-jets, and standard particle jets. We aim to 
label the latter using the former. 

Arguably the simplest labelling algorithm is to assign each particle jet the label of the closest 
parton jet, i.e. that which minimises ARjet-iabel- This has the drawback, however, that distinct 
particle jets can share the same closest parton jet: should the particle jets share the same label, 
or should some additional matching criterion be introduced to assign the parton jet to just one 
particle jet, e.g. the nearer of the two? 

In this study we have hence used the ghost association |T5| method to non-invasively cluster 
the parton jets into the particle jets, guaranteeing that no parton label will be associated to 
multiple particle jet^ 

A second ambiguity now arises, because more than one parton jet can be ghost-associated to a 
given particle jet. Since the QCD-aware clustering forbids combination of some parton flavours, 
having multiple unclustered partons within a particle jet's clustering radius is a fairly frequent 
occurrence - moreso than the many-particle-to-one-parton ambiguity that ghost association 
resolves. Hence a disambiguation measure is required among the associated parton jets, and 
for simplicity we have chosen the label which minimises ARje^.j^bel/ within an inner core of the 

^Note that this is for the purposes of definiteness more than absolute physical correctness: in such ambiguous 
circumstances there is no guarantee that ghost association has picked the physically "correct" assignment, or 
that such a thing even exists. 
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jet radius: if all AR > 0.2, the jet remains unlabelled. This restriction to AR < 0.2 was added 
to remove long, low rate tails observed in initial runs of the algorithm. This may certainly be 
improved, and we suggest either a combined AR and pj matching measure (although this is a 
little like adding apples and oranges), to favour high-px or well-matched pj labels within the 
jet cone, and/or to assign weights rather than absolute labels - but we do not consider such 
extensions in this paper. 

4 Performance of QCD-aware labelling 

In this section we will make performance comparisons of the above-described labelling algorithm 
for two hard processes, dijet and 7 -jet, with various parton shower event generators, and with 
several systematic variations to both the labelling scheme and to the simulation: 

Shower generators: Pythia 8.201, Herwig++ 2.7.1, Sherpa 2.1.1 

Clustering variants: max-px (no clustering) / kj QCD-aware / anti-fcx QCD-aware 

Simulation variants: normal / without MPl / raised shower cutoff / ME max multiplicity 

Association variants: all labels / reclustered labels 

A minimum pj requirement of 25 GeV has been imposed on the particle jets in these studies, 
and a px > 5 GeV requirement on the partonic label jets. Both types of jets were clustered with 
an R parameter of 0.4. All jet clustering was restricted to within \f]\ < 2.5. 

For comparison to the QGD-aware approach, we will present a "maximum px" partonic jet 
labelling scheme, where the label assigned to a final-state truth jet is the flavour of the highest-px 
parton within its radius. This label is discovered by looping over all partons, including those in 
the hard process final state (typically in the partonic centre-of-mass frame of the matrix element 
calculation), through all the intermediate stages of the parton shower and MPl, down to the final 
partons described in Section]^ on page|^ Since the highest-px parton is used, this tends to be 
from the hard process or shortly after, before it has radiated significant virtuality via shower 
branchings. The measures that we use for labelling kinematic performance would be biased in 
this scheme, hence we will only show it in direct comparisons either of label assignments or in 
ratios of flavour label rates. 

4.1 MC generator families 

A key motivation for the QGD-aware approach to partonic truth-jet labelling is for the method 
to be portable between different MG generators. Each plot in most of the following studies is 
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hence shown with three MC lines, for the three major parton shower MC generator families; the 
exact versions are given above. 

In principle, the QCD-aware method should be robust enough for use both with fixed-order 
codes (producing a few-body partonic final state) and parton shower codes in which the final- 
state partonic multiplicity is much higher. Substantial differences between the Herwig and 
Pythia generator families have been seen in q/g rate prediction studies using the max-pj 
labelling scheme so some level of variation is to be expected between generator shower 

formalisms, but we expect broad qualitative agreement of labelling both between generators 
and with the expectations for each hard process type. 

In particular we are interested to see how the Sherpa generator compares to the Herwig and 
Pythia families, since it has not previously been possible to include Sherpa in partonic labelling 
studies. 

In all the plots that follow in this section, all three generators have been configured to use 
only lowest-order 2 —> 2 hard scattering matrix elements, so that any differences are due to MC 
family differences in parton shower algorithms, and matrix element scale choices. 

More specific modelling variations within generator families are considered in Sections |4.4| 
and|4.5| using only Pythia and only Sherpa respectively. 


4.2 Performance of default QCD-aware labelling 


In Figures 1^ and 1^ we show the pj resolution, Apj/p 


— /"Jet 


T= [Pt 


„ label 
Pt 


1 ct 

/ Pt and AR measure 

between the particle jets and their assigned labels in the incIusive-QCD-jet process. The light 
parton measures are shown for the photon+jet hard process type in Figure]^ the less interesting 
heavy label performance being relegated to Appendix A A kj measure has been used in the 
construction of the QCD-aware labelling jets, but the particle jets are anti-fcx as is standard for 
the LHC experiments; the effect of this mismatch will be investigated in Section |43| on page 13 


The effect of ME scale dependence on the plot normalisation has been largely eliminated by 
rescaling all histograms to correspond to the Pythia 8 cross-section calculation. 

The main point of note in these plots is general consistency between generators, and good- 
quality kinematic matching of the labels to the particle jets. The consistency is not perfect - for 
example Pythia events feature significantly more bottom-labelled jets than either of the other 
generators in the jets process, and Herwig++ produces many more gluon-labelled jets in the 
7 -|- jet event type - but otherwise these resolutions are very compatible both with each other 
and with the expectations of approximate parton-jet duality. Where shape differences are seen, 
e.g. the Herwig++ high tail for light-quark-labelled Apj/ pj in jet events, they are significantly 
suppressed relative to the peak of the distribution; addition of a pj resolution window cut to the 
labelling algorithm could remove the few anomalous labels assigned from long tails such as this. 
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Figure 2: Light tagging performance/comparisons: inclusive jet events, with gluon-labelled jets 
on the top row and light-quark-labelled jets on the bottom row. Particle jets clustered 
using the anti-^T’ algorithm, labelling parton jets with the kj algorithm. 
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Figure 3: Heavy tagging performance/comparisons; inclusive jet events, with bottom-labelled 
jets on the top row and charm-labelled jets on the bottom row. Particle jets clustered 
using the anti-^T’ algorithm, labelling parton jets with the kj algorithm. 
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Figure 4: Light tagging performance/comparisons: 7 + jet events, with gluon-labelled jets on the 
top row, light-quark-labelled jets in the middle row, and isolated prompt photons on 
the bottom row. Particle jets clustered using the anti-kj algorithm, labelling parton jets 
with the kj algorithm. The one-sided pj resolution distribution for the photon-labelled 
jet is by construction as described in the text. 
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Scheme 

Generator 

Jets 

‘l/g 

7 -6 jet 
l/g ^Ig 

Max-px 

Pythia 8 

0.38 

17.2 

10.5 


Herwig++ 

0.33 

in 

4.8 


Sherpa 

0.55 

21.0 

9.6 

kj 

Pythia 8 

0.80 

10.4 

8.2 


Herwig++ 

1.17 

3.6 

4.6 


Sherpa 

0.85 

10.5 

7.5 

anti-kx 

Pythia 8 

0.79 

10.2 

8.3 


Herwig++ 

1.74 

3.2 

4.5 


Sherpa 

0.86 

10.2 

7.5 

Reclustered 

Pythia 8 

0.77 

10.1 

8.0 


Herwig++ 

1.36 

3.5 

4.8 


Sherpa 

0.83 

10.1 

7.3 


Table 1: Jet label ratios for the combined sample of leading and subleading jets constructed in 
inclusive jet and 7 + jet simulated events, for various MC generators. 


The one-sided Apx distribution for the 7 + jet samples is deserving of explanation. This feature 
is by construction: to be labelled as a photon, the labelling "parton" is the exact same final-state 
photon as will enter the particle-jet finding, because there is no parton that a photon can cluster 
with, without losing its flavour. Hence the total jet pj must be at least the same as the labelling 
photon: momentum can only be added, not subtracted, within the jet cone. By contrast, for quark 
or gluon jets there are non-perturbative processes such as hadronisation and colour-reconnection 
which can reduce the particle-jet energy below that of their labelling parton. It is likely that an 
asymmetry of similar size is convolved into the Apj/pj distributions for quark and gluon jets, 
but that their overall greater width from non-perturbative modelling dominates the roughly 
symmetric peak shape. 

4.2.1 Flavour label ratios 

The other feature of these plots immediately worth commenting on is how the total number of 
jets labelled as gluon or quark in these two samples compare to the cross-sections of the fixed- 
order subprocess matrix elements. We compute these ratios from the integral of each distribution 
for the two leading jets only, to ensure final-state quantities comparable to the 2 -particle matrix 
element final states, and are not biased by the propensities of the generators to produce different 
numbers of above-threshold jets. The summary ratios are presented in Table and we now 
discuss them for the two process types. 

Inclusive Jet sample: the Pythia 8 leading-order matrix elements had the following cross-sections: 
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^{gg gg) = 2-71 mb, a{qg qg) = 1.41 mb, a{qq qq) = 0.15 mb, (r{gg qq) = 
0.07 mb. Summing these according to their contributions to the rates of final state quark or 
gluon partons, a fixed-order quark/gluon jet ratio of 0.27 is expected. 

Comparing the normalisations of the simulated gluon and quark observables gives QCD- 
aware kj quark/gluon labelling ratios of between 0 . 8 - 1.2 for the three shower generators, 
with Herwig++ at 1.17 a clear outlier from the other two generators near 0.8. The equivalent 
range for the max-p^ labelling scheme is 0.33-0.55. There is hence a substantial different 
between the q/g ratios obtained from max-pj and QCD-aware schemes, the max-pj 
scheme giving label rates closer to the fixed-order expectation than QCD-aware - although 
this should not be overinterpreted as indicting (in)correctness of either scheme. 

The inverted q/g ratio from Herwig++ is due to an excess of quarks, clearly visible in the 
light-quark plots of Figure]^ on page|^ Requiring a jet-label pj match by cuts on the tails 
of Pt resolution brings the Herwig++ q/g ratio to 1.0 - closer to the others, but still not 
good agreement. 

7 + jet sample: applying the same methodology as for the inclusive jet ratios, the two tree- 
level subprocess cross-sections are (^iqg —> qy) = 650 nb and (r{qq —> gy) = 53 nb, 
corresponding to expected fixed-order photon/gluon and quark/gluon ratios of 13.2 and 
12 . 2 . 

The fully showered QCD-aware kj ratios are between 3.6-10.4 and 4.6-8.2 respectively, 
and the max-pj equivalents are 7.7-21.0 and 4.8-10.5. In both schemes, there is agreement 
between the rates for Pythia 8 and Sherpa which are at the high ends of the ratio ranges 
for both processes, but Herwig++ is highly discrepant. The much lower 7 /y and q/g rates 
for Herwig++ appear to be driven by its very high gluon rate, clearly visible in Figure]^ on 
page[^ there is this time no obvious matching cut on AR or pj resolution which could 
address this issue. 

We stress again that these predictions are subject to significant resummation corrections and 
hence do not identify the "right answer". The broad expectations from fixed order subprocess 
cross-sections are met by Sherpa and Pythia 8 , but Herwig++ significantly deviates from the other 
generators by producing unusually many quark jets in jet events, and gluon jets in 7 -|- jet events. 
We will return to these features later. 

4.3 Dependence on clustering distance measure 

In our introduction of the QCD-aware method, we motivated partonic clusterings by analogy to a 
"rewinding" of QCD evolution through gluon radiation (and the analogous evolution for photon 
emission). This was also the historical motivation for the flavour-blind "bland" kj clustering, but 
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kt label 
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c 

b 7 
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0.8 
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- 
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- 
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- 
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- 

b 

- 

- 
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- 
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7 

- 

- 

- 

- 
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/cf-reclustered label 



none 



c 

b 7 
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- 
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0.3 

- 

- 

g 

- 

53.0 
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- 

2.5 

33.8 

- 

- 
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5.0 

- 
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- 
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- 
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7 

- 

- 
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max-pT label 




none g 

‘i 

c 

b 7 
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- 1.1 

0.1 

- 

- 



- 53.5 

1.1 

- 

- 

rO 

‘i 

- 15.7 

20.5 

- 

- 


c 

- 1.5 

_ 

3.6 

_ 


b 

- 0.4 

- 

- 

2.1 - 


7 

- 

- 

- 
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Table 2: Correlation matrices for pairs of labelling schemes in Pythia inclusive jet events. Each 
entry shows the fraction of all jets (in percent) given a pair of labels by the schemes 
listed on the vertical and horizontal axes, q here denotes a light-quark label. Fractions 
less than 0.1% are replaced with a dash. 

it has since proven useful to use alternative measures, most notably the anti-^T’ distance which 
has no clear link to resummation. We here entertain the possibility that the anti-kj distance 
measure may prove to have useful properties despite its relative lack of a priori motivation. 

To reduce the potential for kinematic mismatch of kj and anti-fci’ jet shapes, we also consider 
a /cT’-based QCD-aware reclustering of final partons matched to anti-fc^ particle jets. In this 
approach, final partons are first ghost-associated IITSlI as part of the construction of anti-/c 7 ’ 
particle jets, then for each particle jet the collection of associated final partons is clustered using 
the QCD-aware kj measure ("fcx-reclustered"). The anti-^x particle jet then inherits the label of 
the kj final parton jet with the smallest AR separation to its axis. 

In Figure 1^ the Apj/pj performance measure is again shown for the three shower MC 
generators, for gluon- and quark-labelled jets, respectively. The AR performance comparisons 
are shown in Figure [10| in the appendix. Figure shows a direct comparison of these spectra 
across labelling schemes for Pythia inclusive jet events. Perhaps surprisingly, the differences in 
shape are small, even though anti-^x is expected to have a virtually meaningless cluster sequence. 
There are two explanations for this: first, the flavour combination rules mean that there is much 
less difference in the clustering sequence than would be the case in a normal flavour-blind 
kj vs. anti-/cx clustering comparison; and second, the resulting labels often depend not on the 
detailed order of clusterings but on whether an unpaired (anti)quark ever gets clustered into 
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Figure 5: Apj/P t performance comparisons for gluon- (left) and light quark-labelled jets (right) 
in inclusive jet events, showing kj labeled jets on the top row, anti-fci’ in the middle, 
and fci’-reclustered on the bottom. 
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Figure 6: AR (left) and Apx/ Pt (right) clustering performance comparisons for gluon (top) and 
light (bottom) labelled jets in Pythia inclusive jet events. 
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Table 3: Correlation matrices for pairs of labelling schemes in Herwig++ 7 + jef evenfs with MPI 
on (left) and MPI off (righf). Each entry shows the fraction of all jets (in percent) given a 
pair of labels by the schemes listed on the vertical and horizontal axes, q here denotes a 
light-quark label. Fractions less than 0.1% are replaced with a dash. 

an otherwise gluon-flavoured parton pseudojet. Clustering of gluons or photons on to a quark 
or lepton-flavoured pseudojet changes only kinematics, not the label, and the change to the 
kinematics is invariant of the order of the boson clustering - the effect of measure is hence largely 
relegated to clusterings near the jet radius and those which occur during the differing periods 
during which the pseudojet kinematics are stabilising. 

How the labels of individual particle jets differ between schemes is also of interest. For instance, 
the kj scheme may assign a jet a gluon label while the anti-fci’ scheme may label it as a quark. 
This is summarised in the label correlation matrices in Table The anti-fc^ and fcx-reclustered 
schemes are in the top row, while the bottom-left matrix compares QCD-aware kj with the 
max-px scheme, in which a particle jet inherits the label of the highest-px ghost-associated 
parton from any step of the QCD shower evolution. Fabelling algorithms similar to the max-px 
scheme have been used in performance studies at ATT AS IlISl . 

Table 1^ gives some indication of fhe origin of fhe difference between max-px and QCD-aware 
labelling for Herwig++ 7 -|- jet events: there is a very significant off-diagonal contribution in the 
q-'Y cell, i.e. jets which are identified as photons in the max-px scheme, but as quarks in the 
QCD-aware scheme. This implies that low-momentum quarks in the vicinity of the hard photon 
are "capturing" that hard object and stealing its photon label. The unanswered question is why 
only Herwig++ and not Pythia or Sherpa have such a significant population of low-momentum 
quark "pollution". This deserves further investigation which would not be appropriate here, 
both investigating the effects of various Herwig++ model features and a more nuanced treatment 
of q-'Y clustering in the QCD-aware algorithm. 

In the absence of strong empirical motivations to choose the anti-fcx or fcx-reclustered labelling 
schemes, the kj measure remains the most obvious choice due to its theoretical links to QCD 
(and QED) emission kinematics in the Sudakov regime, and because where scheme-dependent 
anomalies are seen, they appear to be more prevalent in the anti-fcx arid reclustered-fcx schemes. 
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4.4 Dependence on parton shower IR-cutofr and multi-parton interactions 

In this section we consider two possible systematic effects in the configuration of the MC 
generator supplying events for jet finding and labelling: the choice of parton shower cutoff scale, 
and the impact of multi-parton interactions. In both cases the process of gluon splitting to quarks 
is the most dangerous to the stability of the jet labelling, since a wide angle soft splitting may 
not be clustered back together (without a distance modification a la flavour-fcx) in this early 
clustering phase, leaving two lone quark labels free to contaminate other gluon pseudojets. 

4.4.1 Parton shower IR-cutoff 

In principle, the QCD-aware labelling technique can be passed a partonic final state at any 
stage of its evolution and the results should remain largely invariant, since the low-px evolution 
should be safely reversed in the clustering. To test this hypothesis we have run Pythia 8 with both 
a normal (0(1 GeV)) parton shower cutoff, and one in which that cutoff has been greatly raised 
to the jet Pt threshold of 30 GeV. The effect of this is seen in Figure]^ and acts both as a very 
conservative estimate to the algorithm's sensitivity to the IR region of shower evolution (where 
different MC generators may use different cutoff tunes) and an indication of the applicability of 
the algorithm to appropriately defined fixed-order simulations. 

From these plots, comparing the blue raised-cutoff line to the red Pythia 8 default configura¬ 
tion, it can be seen that the low-p^ splittings in the default configuration increase the rate of 
light-quark jet labels, while having little impact on the gluon-label rate. As well as in the overall 
normalisations, this effect can particularly be seen in the high "shoulder" in the bottom-right 
quark-label p^-resolution observable, which is most prominent for the red default setup. This 
suggests that the relative ease oi q/g contamination leads to "wrong" light-quark labels being 
derived from gluon splittings to light quarks in the high multiplicity of low-p^ shower branch¬ 
ings, and hints that an improved matching requirement might prefer to label with the highest-pr 
label jet close to the particle jet axis in order to reduce this "shoulder" of mismatched too-low-pj’ 
labels. 

4.4.2 Multi-parton interactions (MPI) 

Multi-parton scattering poses a potentially lethal threat to an algorithm starting from final state 
partons, because of the intrinsic assumption that at least a kj-hased clustering will be able to 
reverse the order of QGD splittings. Typically we are only interested (as far as possible) in the 
hardest partonic interaction, but MPI overlays the partonic final state of that interaction with 
those from secondary partonic scatters. As a result of this incoherent overlay of distinct partonic 
final states, the geometrically optimal partonic clusterings may be between partons evolved 
from different hard processes, which (at least in model terms) are unrelated. This is expected to 
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Scheme 

Generator 

Jets 

H/g 

7 -6 jet 
'y/g ^/g 

Max-px 

Pythia 8 

0.39 

15.4 

9.5 


Herwig++ 

0.33 

18.3 

11.4 


Sherpa 

0.57 

13.4 

7.0 

kj 

Pythia 8 

0.65 

11.8 

7.6 


Herwig++ 

0.68 

11.2 

8.0 


Sherpa 

0.73 

13.0 

7.0 

anti-A: 7 

Pythia 8 

0.65 

11.7 

7.6 


Herwig++ 

0.93 

11.0 

8.1 


Sherpa 

0.74 

12.9 

7.0 

Reclustered 

Pythia 8 

0.64 

11.5 

7.5 


Herwig++ 

0.80 

11.0 

8.2 


Sherpa 

0.73 

12.7 

6.9 


Table 4: Jet label ratios for the combined sample of leading and subleading jets constructed 
in inclusive jet and 7 + jet simulated events, for MC generators with MPI modelling 
disabled. 


be particularly problematic for MPI quarks, since unlike gluons (or photons) they can lead to a 
reassignment of "true" gluon jet flavour labels. 

All the performance plots shown so far have had MPI modelling enabled, and in the label 
ratio discrepancies seen in Section 4.2.1 on page 12 - particularly for Herwig++ with respect 
to the other generators - there are hints that MPI overlay could be responsible. The gg —> gg 
process dominates the low-x MPI processesbut shower evolution of the MPI partons would be 
expected to produce some unpaired gluon splittings with a resulting effect. In Figure]^ on the 
following page we also compare Pythia 8 with and without MPI modelling enabled, to gauge 
the magnitude of the effect. 

It is clear that the green no-MPI configuration gives substantially better AR and pj agreement 
between the particle jets and their labels, as well as slightly reducing the overall normalisation - 
both effects are seen in both the gluon and quark distributions. Resolution improvements were 
also seen, in the unshown charm and bottom performance measures. This normalisation change, 
however, does not relate to an increased rate of unlabelled jets, but just the lower total number 
of particle jets in a no-MPI generator configuration (this was checked separately and is not 
evident from the displayed plots). While the removal of MPI produces narrower distributions 
than default in all cases, the largest effect is again seen on the upper side of the light quark pj- 
resolution observable, where adding soft MPI emissions enhances the pr-mismatched "shoulder" 
to the same extent as soft shower splittings did in the previous section. 

In Table 1^ we show the same jet label ratios as computed before, but now with MPI disabled 
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Figure 7: Light tagging dependence on event simulation systematics: inclusive jet events, with 
gluon-labelled jets on the top row, and light-quark-labelled jets on the bottom. 

for all three generators. The difference is striking: the PythiaS and Sherpa q/g ratios in jet events 
are little changed, but Herwig++'s "inverted" ratio is now shifted to agree with the others; and 
in 7 -|- jet events the high Herwig++ gluon rate is now gone, bringing its ratios in line with the 
other generators (and with very stable, cross-generator values in the QCD-aware scheme). MPI 
is clearly a very significant problem for post-fragmentation truth-jet labelling algorithms to 
address. 

4.5 Dependence on higher-order ME parton production 

In this section we make a final study of the robustness of the QCD-aware labelling scheme, 
considering variations in the maximum number of matrix element final-state partons for QCD jet 
events. This study was performed using the Sherpa event generator, configured in three separate 
runs to use a lowest-order 2—^2 QCD scattering matrix element as in the previous studies, as 
well as higher-order tree-level 2 —> 3 and 2 —> 4 MBs. 

The dominance of the lowest-order hard process configuration is illustrated by the stability of 
the cross-section which rose only by 0.1% from the 2—^2 value of 3.073 mb with the addition 
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Figure 8: Jet labelling performance dependence on maximum number of ME partons in jet 
events for gluon- (left) and light quark-labelled jets (right). The top row shows ah jets 
inclusively, the bottom only for the 3rd jet. 


21 









































ME 


Gluon frac. 

Light quark frac. 

Light parton frac. 

Unlabelled frac. 

2^2 

1.00 

62.7% 

27.0% 

89.6% 

2.3% 

2^3 

1.59 

56.4% 

31.4% 

88.3% 

2.9% 

2^4 

1.79 

58.3% 

31.9% 

90.2% 

2.6% 


Table 5: Comparison of 3rd jet rates and flavour label tractions between Sherpa calculation with 
three different maximal ME final-state multiplicities. 


of diagrams with up to two extra final-state partons. The inclusive jet plots in Figure]^ on the 
previous page show a larger effect, since the total number of jets to pass the 25 GeV analysis pj 
cut increased by 4.5% and 5.7% over the 2-parton configuration for the 3- and 4-parton samples 
respectively. But the stability of the performance measures is still notable. 

We see larger differences if we focus on the 3rd-hardest jet in the event, which in the 2 —> 2 ME 
configuration should virtually never directly correspond to an ME parton. The usual labelling 
performance measures are shown for this in the bottom row of Figure]^ on the preceding page. It 
is not simple to interpret these plots because the dominant effect is the change in normalisation 
due to the increased total number of 3rd jets as the matrix elements include more hard corrections: 
the increases are 59.1% and 79.3% above the 2 —> 2 configuration for the 3- and 4-parton MEs 
respectively. If we instead look at ratios of light quark and gluon labels to the total, shown in 
Table then stability is again evident: gluon labels account for 58-63% and light quarks for 
27-32% of 3rd jets despite the large normalisation changes. The total number of jets labelled 
as either gluon or light quark remains between 88% and 90% of the total, and the fraction of 
unlabelled jets is also stable between 2-3%. No asymmetry or change of distribution widths is 
observed with the changes of ME multiplicity. 

We note for clarity that it is not the case that a perfect labelling algorithm would give perfectly 
stable results in these tests - after all, the physics is being improved with each extra parton, 
changing the jet kinematics and potentially the flavour balance. But it does suggest both algorith¬ 
mic robustness and that the light-flavour weighting (it not the kinematics) of the Sherpa parton 
shower splitting functions is well-matched to the explicit ME calculations. 


5 Conclusions and proposals 

Just as there is no absolute definition of what constitutes a hadronic jet, there is no absolute way 
to assign a flavour label to it. But, as for jet finding, there are differences in the algorithms which 
are used as operational definitions, and not all algorithms are equal. 

In this spirit, we have described and characterised the performance of the QCD-aware algo¬ 
rithm for truth jet partonic labelling. This algorithm is based on restricting flavour combinations 
to those permitted within QCD and QED, and on final state partonic inputs defined in the lab 
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frame. It offers a theory-motivated labelling approach portable between all the major families of 
parton shower event generators, and shows fairly low sensitivity to calculation artefacts such as 
parton evolution cutoffs and ME order. Comparable labelling performance was seen between 
the generator families, across a range of hard process types with multi-parton interactions (MPI) 
disabled: the ratios of jet label rates from the three generators were in good agreement with each 
other, and the dominant labels agreed with the prediction from fixed-order cross-sections. A 
lack of strong dependence on distance measure was observed, due to the constraining effects 
of the flavour combination rules. Hence, while "just another algorithm", we contest that the 
QCD-aware approach is better theoretically motivated than existing approaches and portable 
between MC shower generators in the absence of MPI, e.g. in e^e~ events. 

However, the labelling performance is affected by MPI, which introduces a "shoulder" struc¬ 
ture several times higher than the no-MPI rate into the tail of the labelling jet pj resolution, and 
induces dramatic changes in the jet label ratios computed for Herwig++. This is a problem which 
must be addressed for use of this algorithm in fully-dressed hadron collider event simulations, 
and studies are underway to improve the robustness to MPI contamination. The exact reason 
for Herwig++'s extreme labelling susceptibility to MPI effects is not yet known. Changing the 
jet-label matching criteria to reject matches from label partons much softer than the particle jet 
will address some issues, and it is possible that suppressing flavour-changing clusterings with 
very soft quarks - as in the flavour-fcx clustering algorithm - may be an effective MPI rejection 
heuristic. 

The code implementation of this algorithm, available in the Fastjet contrib repository as 
QCDAware, allows for flexible "hybrid" approaches such as QCD-aware kj reclustering of labelling 
partons selected by association to anti-/c 7 particle jets, for increased consistency with LHC 
experiment procedures. 

It remains to be seen whether there is substance to hints that the labelling results may slightly 
overestimate the rate of quark jets. This is quite conceivable since pseudojet quark labels, once 
acquired, are harder to lose than gluon labels are: overlay of quarks from MPI can easily switch 
the label of an aligned hard-process gluon jet. This is a clear area for further investigation, in 
which the most obvious step is to introduce flavour-/c 7 style extra weighting tor quark & gluon 
kinematics, as well as including the effects of different colour and EM charges, and the relative 
(running) strengths of the QED and QCD couplings. These extensions provide a clear motivation 
to focus on the kj measure as the canonical distance choice for QCD-aware labelling. 

Finally, we note that the inclusion of leptons and photons in the QCD-aware combination rules 
provides an attractive way to define truth-level dressed leptons and isolated photons in addition 
to jets, without overlaps or false distinctions between e.g. hard and "fragmentation" photons. 
Thus it may otter an appealing alternative truth object definition scheme to those already on the 
market KT9|. There is perhaps also the possibility to extend the QCD-aware scheme with further 
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processes such as EW resonance decays which (like for partons) may not be reliably encoded 
in shower generator records: e.g. arid Wh —^ t clusterings could be added with 

appropriate clustering weights. However, as the kinematics ot these processes are non-Sudakov, 
there is no guarantee that they would work optimally, or even at all. 


Acknowledgements 

Our thanks to Donatas Zaripovas for his early work on testing the behaviour ot the QCD-aware 
algorithm, to Gavin Salam for clarifying our questions about the impact and necessity of flavour- 
kj distance measure modifications, and to Jesse Thaler for feedback on a draft version of this 
paper. And of course to the Fastjet team for providing such a useful tool and their "contrib" 
repository for collecting community contributions such as the code described here. 

References 

[1] S. Frixione, P. Nason, and C. Oleari, Matching NLO QCD computations with Parton Shower 
simulations: the POWHEG method, JHEP 0711 (2007) 070, arXiv: 0709.2092 [hep-ph], 

[2] T. Gleisberg, S. Hoeche, F. Krauss, M. Schonherr, S. Schumaim, et al.. Event generation with 
SHERPA 1.1, JHEP 0902 (2009) 007, arXiv: 0811.4622 [hep-ph]. 

[3] S. Schumarm and F. Krauss, A parton shower algorithm based on Catani-Seymour dipole 
factorisation, JHEP 0803 (2008) 038, arXiv: 0709.1027 [hep-ph], 

[4] T. Sjostrand, S. Mreima, and P. Skands, PYTHIA 6.4 physics and manual, JHEP 05 (2006) 026, 
arXiv:hep-ph/0603175. 

[5] T. Sjostrand, S. Mreima, and P. Z. Skands, A Brief Introduction to PYTHIA 8.1, 
Comput.Phys.Commun. 178 (2008) 852-867, arXiv: 0710.3820 [hep-ph], 

[6] T. Sjostrand, S. Ask, J. R. Christiansen, R. Corke, N. Desai, et al.. An Introduction to PYTHIA 
8.2, Comput.Phys.Commun. 191 (2015) 159-177, arXiv: 1410.3012 [hep-ph], 

[7] G. Corcella et al. JHEP 01 (2001) 010, hep-ph/0210213. 

[8] M. Bahr, S. Gieseke, M. Gigg, D. Grellscheid, K. Hamilton, et al., Herwig++ Physics and 
Manual, Eur.Phys.J. C58 (2008) 639-707, arXiv: 0803.0883 [hep-ph], 

[9] A. Banfi, G. P. Salam, and G. Zanderighi, Infrared safe definition of jet flavor, Eur.Phys.J. C47 
(2006) 113-124, arXiv:hep-ph/0601139 [hep-ph], 


24 




[10] G. P. Salam, Towards Jetography, Eur.PhysJ. C67 (2010) 637-686, arXiv: 0906.1833 
[hep-ph], 

[11] M. Cacciari, G. P Salam, and G. Soyez, Fastjet User Manual, Eur.PhysJ. C72 (2012) 1896, 
arXiv:1111.6097 [hep-ph], 

[12] D. E. Soper and M. Spannowsky, Finding physics signals with shower deconstruction, Phys.Rev. 
D84 (2011) 074002, arXiv: 1102.3480 [hep-ph]. 

[13] M. Dobbs and J. B. Hansen, The HepMC C++ Monte Carlo event record for High Energy Physics, 
Gomput.Phys.Gommun. 134 (2001) 41-46, 

[14] A. Buckley, J. Butterworth, L. Lonnblad, D. Grellscheid, H. Hoeth, J. Monk, H. Schulz, and 
E. Siegert, Rivet user manual, Gomput. Phys. Gommun. 184 (2013) 2803-2819 , 

arXiv:1003.0694 [hep-ph], 

[15] M. Cacciari and G. P. Salam, Pileup subtraction using jet areas, Phys.Lett. B659 (2008) 119-126 , 
arXiv:0707.1378 [hep-ph], 

[16] D. Adams, A. Arce, L. Asquith, M. Backovic, T. Barillari, et al.. Towards an Understanding of 
the Correlations in Jet Substructure, arXiv: 1504.00679 [hep-ph]. 

[17] A. J. Larkoski, J. Thaler, and W. J. Waalewijn, Gaining (Mutual) Information about 
Quark/Gluon Discrimination, JHEP 1411 (2014) 129, arXiv: 1408.3122 [hep-ph], 

[18] ATLAS Collaboration, Jet energy measurement with the ATLAS detector in proton-proton 
collisions at ^/s = 7 TeV, Eur.Phys.J. C73 (2013) no. 3, 2304, arXiv: 1112.6426 [hep-ex], 

[19] ATLAS Collaboration, Proposal for truth particle observable definitions. Tech. Rep. 
ATL-PHYS-PUB-2015-013, CERN, Geneva, May, 2015. 

https://cds.cern.ch/record/1700541. 


25 





arbitrary arbitrary 


A Further performance plots 



- PythiaS 

- Herwig++ 

- Sherpa 


0 0.05 0.1 0.15 0.2 

AR 




-1 -0.5 o 0.5 1 

Pj resolution 


Figure 9: Heavy tagging performance/comparisons: 7 + jet events, with bottom-labelled jets on 
the top row, and charm-labelled jets on the bottom row. 
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Figure 10: AR performance comparisons for gluon- (left) and light quark-labelled jets (right) in 
inclusive jet events, showing kj labeled jets on the top row, anti-^T’ in the middle, and 
fcx-reclustered on the bottom. 
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Figure 11: Jet labelling dependence on maximum number of ME partons in jet events for gluon- 
(left) and light quark-labelled jets (right). The top row shows all jets inclusively, the 
bottom only tor the 3rd jet. 
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